Assamese Text to Speech Corpus

Assamese Text to Speech Corpus

0 reviews requests (0)
Catalogue Number: 1514
Stock In Stock

OverView

Assamese Text to Speech Corpus 44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual...
Please Login to see the price

Dataset Description

Assamese Text to Speech Corpus 

44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers 


The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Assamese script. This dataset spans a duration of 44:49:34 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Assamese speakers. A comprehensive explanation of dataset can be found in the Assamese Text to Speech Documentation. 


For any research-based citations, please use the following citations: 

  1. Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Assamese Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-45-3. 
  2. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.

Item specifics

  • Authors Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan.
  • Corpus Type Text to Speech Corpus
  • Catalogue Number 1514
  • ISBN 978-93-48633-45-3
  • Data Source On Field
  • Duration 44:49:34 hours
  • # of Audio Segments 32594
  • Release Date 3/20/2025
  • Terms and Conditions General instructions for use of the resources provided by LDC-IL.
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview

Write a review

Please login or register to review